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Abstract 

Standard cosmology is based on the assumption that the universe is 
spatially homogeneous. However the consensus on a homogeneous matter 
structure, even on very large scales, has never been complete. The ad- 
vantage of correlation dimension calculations is that they enable one to 
obtain the transition scale to a homogeneous distribution, whereas other 
methods, such as those using the two-point correlation function, make 
it hard to exhibit the possible fractal properties of the Universe matter 
content. Our purpose is to calculate the correlation dimension D2, look- 
ing for a possible transition to homogeneity, which would imply D2 = 3. 
We apply the correlation integral method to the three dimensional sam- 
ple composed of 332,876 galaxies which we extract from the Fifth Data 
Release of the Sloan Digital Sky Survey. We analyze the raw data up to 
the scale dmax — 160 Mpc, assuming — 70 km s~^ Mpc~^ and consid- 
ering three cosmological models in order to test the model dependence of 
our method. Using volume limited samples for this range leaves us with 
about 20,000 galaxies. Applying our method to random maps helps us 
to calibrate our results. We obtain a correlation dimension of the galaxy 
distribution which seems to increase with scales up to 1^2 = 3 reached 
around 70 Mpc. The results of our analysis, performed on the largest 
volume limited sample which can be extracted from the SDSS catalog, 
are compatible with those formerly obtained by other authors. However, 
to get a more reUable description of the structures at various scales, we 
think it will be mandatory to use still larger catalogs than those which 
are currently available. 

1 Introduction 

Standard cosmology is based on the assumption that the Universe is spatially ho- 
mogeneous, at least on scales sufficiently large to justify its approximation by a 
Friedmann-Lemaitre-Robertson- Walker (FLRW) model. The high 

isotropy measured in the cosmic microwave background radiation (CMBR) is 
usually considered as a strong evidence in support of this hypothesis. 



The structures seen in galaxy catalogs - groups, clusters and super-clusters, 
distributed along voids, filaments and walls - are not viewed as contradicting 
this principle, as the common opinion is that the scales on which the universe 
is assumed to be homogeneous are much larger than those subtended by these 
structures. However, the consensus on a homogeneous feature of matter, even 
on very large scales, has never been complete (see, e.g., Pietronero, 1987). At 
least, the value of the transition scale from inhomogeneity to homogeneity needs 
to be tested (Celerier, 2000; Romano, 2007). 

One of our motivations for the present study comes from our interest in 
trying to distinguish the different scaling regimes that may exist in the galaxy 
distribution, though on different length scales. Our interest was first raised by 
the apparent discrepancy in the collection of results obtained from the analyses 
of data realized by a number of authors, which appeared to differ essentially, 
not only by the statistical methods employed, but mostly by the scales spanned 
by the studied samples. 

A little more than twenty years ago, three-dimensional galaxy catalogs have 
been made available. They supported the first findings of approximate self- 
similarity in the large-scale distribution of galaxies within prescribed scale in- 
tervals (Provenzale, 1991). 

It was claimed, from an analysis of the data of the CfA redshift survey cata- 
log, worked out with the correlation function method, that, at small separations, 
i.e., for a correlation length tq = 5h~^ Mpc, the galaxy distribution exhibits 
a fractal structure with dimension D2 ^ 1.2 (e.g., Davis and Peebles, 1983). 
Other methods applied to the same catalog, and, in particular, the correlation 
integral method (Coleman, Pietronero and Sanders, 1988) gave a fractal dimen- 
sion slightly larger, D2 ~ 1.3 to 1.5. Using a set of measurements from the 
Observatory of Nice, a value of 1^2 ~ 2 was found (Thieberger, Spiegel and 
Smith, 1990). Pietronero and his collaborators (Sylos Labini, Montuori and 
Pietronero, 1998) proceeded then to the analysis of all the currently available 
redshift surveys and claimed that the galaxy distribution exhibits a constant 
correlation dimension D2 ^ 2 up to scales of at least 150/i~^Mpc. These results 
have been widely discussed in review articles published in 1999 (Martinez, 1999; 
Wu, Lahav and Rees, 1999). 

It was also suggested that three scaling regimes may be discerned in the 
galaxy distribution. For the Perseus-Pisces redshift survey and a reanalysis of 
other published results, the values ran from D2 ~ 1.2 at small separations, i. 
e., up to 3.5/1"^ Mpc, to D2 ^ 2.2 on larger scales, with homogeneity seemingly 
reached at scales > 30h~^ Mpc (Guzzo et al., 1991). Murante et al. (1998) 
claimed that on the smallest scales the results are consistent with a distribution 
of density singularities, that in the intermediate range there is a scaling behavior 
suggestive of flat structures such as Zeldovitch (1970) favored and that on the 
largest scales, the data seem to indicate a homogeneous galaxy distribution. 

The last few years have seen a dramatic increase in the number and depth 
of galaxies with known redshift s. Therefore, many authors studied the galaxy 
clustering, using the matter power spectrum as an indicator (see, e. g., Tegmark 
et al., 2002; Tegmark et al., 2004; Sylos Labini et al., 2007; Percival et al., 2007). 
However, this method is not adapted to compute accurately correlation dimen- 
sions (Pietronero, 1987). Therefore, other authors calculated the correlation 
dimension D2 for different scales using various methods (see Jones et al., 2005, 
for a review). A comparison of all these results shows that this dimension is 
unambiguously scale- dependent and increases from values less than 2 at scales 
less than 10 Mpc (Martinez and Jones, 1990; Guzzo et al. 1991) to values 
approaching 3 at much larger scales (Pan and Coles, 2000; Hogg et al., 2005; 
Tikhonov, 2006). 
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However, it has to be admitted that these data may as yet not be adequate to 
clearly decide such an issue and that is why we wish to reexamine it in the light 
of one of the currently available wide catalogs. We use here the DR5 sample 
from the Sloan Digital Sky Survey (SDSS, 2006) to enable us to repeat older 
calculations and hope to obtain more reliable results. 

To test the influence of cosmological distortion, already discussed from a the- 
oretical point of view in (Spedalere and Schucking, 1980; Ribeiro, 1995; Celerier 
and Thieberger, 2001), we perform our calculations for three cosmological mod- 
els, the Euclidean Universe, The Einstein-de Sitter model and the ACDM model, 
and we compare the results. We also study the other possible bias or systematic 
errors. 

We devote the next Sec. [2] to a short reminder of the correlation integral 
method. In Sec. [3] we describe the sample we extract from the catalog to com- 
plete our study. The possible bias and systematic errors are discussed in Sec. IH 
before describing our analysis of the sample and giving our main results in Sec.[5l 
We test these results in Sec. [6] by comparing them to the analysis of a mock 
catalog completed with the same method. In Sec. [71 we give a discussion and 
our conclusion. 



2 The Correlation Integral Method. 

The advantage of correlation dimension calculations is that they enable one to 
obtain the transition scale to a homogeneous distribution, whereas the usual 
method (two-point correlation function calculation) makes it hard to exhibit 
the possible fractal properties of the Universe (Pietronero, 1987). 

Thieberger, Spiegel and Smith (1990) discussed the different approaches to 
calculate correlation dimensions. The two methods most commonly used are 
the correlation integral (Grassberger and Procaccia, 1983) and the F method 
used originally by Pietronero (Pietronero, 1987). That paper gives a comparison 
between the various methods using well-known fractal sets for which we are able 
to derive the exact dimension. We choose here to use, as a characterization of 
the of point set structures, the correlation integral (Grassberger and Procaccia, 
1983), defined as: 

C2{r) = — E E - - X, I), (1) 

where O is the Heaviside function. The inner summation is over the whole set 
of — 1 galaxies with coordinates Xj, j 7^ i, and the outer summation is over 
a subset of N' galaxies, taken as centers, with coordinates X^. By taking only 
the inner N' galaxies as centers we allow for the effect of sample finiteness, i. 
e., for the exclusion of the points close to the edges (see, e. g., Provenzale et 
al., 1997; Sylos Labini, Montuori and Pietronero, 1998). As it is mentioned 
in those references, there are other methods for taking the edges into account. 
They enable one to consider larger volumes, but one pays out by not knowing 
exactly what are the approximations involved in it. We will return to discuss 
these points in the data section. In Eq. ([T|) r goes from a small value to rmax- 
Since r^ax is not too large (typically 700 Mpc), light cone effects might be small 
enough to be ignored (Celerier and Thieberger, 2001). However, we will check 
this assumption by testing three different cosmological models in our subsequent 
calculations. 

We would like to stress that this characterization is also valid when the set 
is not fractal. Therefore it seems to us appropriate to use this approach in all 
cases of analyzing galaxies as point sets. 
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Now, we may interpret C2{r) as JV{r)/N where JV{r) is the average number 
of galaxies within a distance r of a typical galaxy in the set. As r goes to zero, 
C2 should also vanish and, for general distributions, we express this property as 
C2 oc YoT computational purposes it is more convenient to use the form: 

log(C2) = CONST. + D2 log(r). (2) 

The exponent D2 is called the correlation dimension and it is necessarily 
< 3 for an embedding space of dimension three. When D2 is not equal to 
the topological dimension 3 of space in our Universe, the distribution is called 
fractal and D2 is its correlation dimension (Mandelbrot, 1982). 

In Eq. ([1]) we have a double summation. In some special cases, e. g., when 
one has a pencil beam catalog, the inner region has to be very small and, in 
the worst case, it can happen that one has to use the distances from a mere 
single point. This of course results in a strong deterioration of the statistics. 
Fortunately, the SDSS catalogue is wide enough and we are not confronted with 
this problem here. 

3 The Sample Extracted from the SDSS Catalog 

The data we analyze in this paper are extracted from the fifth release of the 
Sloan Digital Sky Survey (SDSS, 2006). 

The Sloan Digital Sky Survey (York et al. 2000) is an ongoing imaging survey 
of approximately tt steradians of the sky in five band pass: ugriz (Fukugita et 
al. 1996; Gunn et al. 1998; Smith et al. 2002). Its aim is also to obtain 
spectroscopic spectra (Richards et al. 2002; Blanton et al. 2003) of about 
one million of the objects detected by photometric monitoring (Hogg et al. 
2001). The rms galaxy redshift errors are ~ 30 km s~^ and hence negligible 
for our purpose. The images are processed by automated pipelines which yield 
photometric calibrations (Lupton et al. 2001; Smith et al. 2002; Stoughton 
et al. 2002; Pier et al. 2003; Ivezic et al. 2004) allowing one to select the 
galaxies from images of stars and quasars (Eisenstein et al. 2001; Strauss et 
al. 2002). For each object, the position, several fiux varieties, morphological 
parameters and a provisional classification are given. The catalog also includes 
informational fiags on each pixel and object (Stoughton et al. 2002). 

The main galaxy sample (Strauss et al. 2002) focuses on galaxies with 
apparent magnitude brighter than m = 17.77 for which spectroscopic spectra are 
available. The sample we use for our study is extracted from this main galaxy 
sample, which contains 465,789 galaxies, from which we make the following 
eliminations: 

0.02 <z< 0.22 , m < 17.77 

z Warning = zW = 0, 

zStatus = zS > 2, 

zConfidence = zC > 0.96. 

The primary fiux measure used for galaxies in this catalog is the SDSS 
Petrosian magnitude, petroMag, which is a modified version of the quantity 
defined by Petrosian (1976). When atmospheric seeing is null, the Petrosian 
magnitude gives a constant fraction of a galaxy light regardless of distance or 
size (Blanton et al. 2001; Strauss et al. 2002). Since galaxies bright enough 
to be included in the SDSS spectroscopic sample have relatively high signal- 
to-noise ratio measurements of their Petrosian magnitude up to m = 20 or so 
(Stoughton et al. 2002), we choose to perform the data analysis reported below 
with the following determination of the apparent magnitude of each galaxy in 
our sample: m = petroMag - extinction (atmospheric absorption). 
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The zWarning flag includes several empirical tests to determine if the pro- 
vided galaxy redshift is reasonable. Of course, choosing zWarning = cuts 
several objects that may be of interest (low signal-to- noise objects for example). 
However, it is a conservative choice which allows us to keep in the sample only 
objects with accurate redshift determination. 

The zStatus flag indicates what is the current status of the redshift deter- 
mination for each object. Above zS = 2, the redshift is available in a consistent 
manner. 

The zConfidence flag is a measure of the statistical confidence level in the 
redshift measurements. 

A review of the SDSS imaging strategy and of the composition andproperties 
of the main galaxy sample can be found in Tegmark et al. (2004) ij. See also 
(SDSS, 2006) for a more thorough description of the catalog and an explanation 
of the above notations. 

After completing these eliminations, we are left with 332,876 galaxies which 
comprise the catalog with which we work. 

4 Bias and Systematic Errors 

4.1 Redshift-space Distortions 

Gravitation, which makes galaxies cluster, also causes them to move with respect 
to the Hubble flow. Their so-called peculiar velocities thus yield an anisotropy 
of their clustering in redshift space. Even if this effect implies that nonlinear 
corrections cannot be neglected at the rather small scales where structures viri- 
alize, it can be exactly modeled and accounted for on the large scales on which 
clustering is linear and which are of interest for us (Kaiser 1987). 

This effect has been thoroughly studied by Tegmark et al. (2004), using 
a sample of SDSS galaxies. They have shown that, although estimates of the 
redshift-space distortions are very sensitive to nonlinear effects, an estimate of 
the real-space matter power is not. They have checked that any scale-dependent 
statistical bias in their real-space matter power spectrum due to nonlinear red- 
shift distortions is smaller than a few percent for the scales under study and 
that the systematic errors associated with this effect are negligible in compar- 
ison with the statistical errors. We therefore rely on this result to ignore this 
effect in our analysis. 

Moreover, for large scales, where the linear Kaiser effect is at work, we 
have tested it on our results by considering three cosmological models with 
different distance-redshift relations. We show thus that these results are model- 
independent (see Sec. [5]) and therefore that we can neglect this effect in our 
analyses. 

4.2 Galaxy Bias 

The main aim of this work is to calculate the correlation dimension(s) of the 
galaxy distribution in view of possibly determining the transition from inho- 
mogeneity to homogeneity which should appear at scales of order (some?) 100 
Mpc. 

A more detailed analysis of the overall clustering amplitude is also potentially 
interesting, e. g., to constrain cosmological models, but it is beyond the scope of 
our study. This normalization is known to be a strong function of both galaxy 
color and luminosity (Norberg et al.2001, 2002; Zehavi et al. 2002). Therefore, 

-•^ Please note the difference between our notation and Tegmark et al.'s who call r the 
apparent magnitude we denote as m. 
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Figure 1: The histogram of RA. 



an understanding of galaxy bias is needed to derive cosmological constraints 
from this statistic. 

On the smallest scales, where non linear corrections to the matter power 
spectrum are large, the shapes of galaxy power spectra are known to depend on 
galaxy color and luminosity (e. g.. Cole et al. 2005). On larger scales, which 
are of interest for us, these effects are more uncertain. We therefore choose to 
ignore galaxy bias in our calculations. 

4.3 Systematic Errors in the Data 

Now, we must contemplate the possibility that systematic errors occur in the 
data. Such effects include radial modulations of the density field due to mises- 
timates of evolution or K-corrections and angular modulations due to miscor- 
rected dust extinction, variable observing conditions, photometric calibration 
errors or fiber collisions. 

They have been analyzed by Tegmark et al. (2004) who showed that their 
effect on the estimate of the matter power spectrum is negligible. We therefore 
ignore these systematics in our study. 

Moreover, since we are only interested in large scale behavior, the problems 
whose effects are limited to small scales, such as fiber collisions, that reduce the 
survey ability to measure redshifts for very close pairs (below 55 arcsec), are 
irrelevant in our case. 

4.4 Sample Incompleteness 

The parameters available in the catalog to locate each object are the right 
ascension, RA, the declination, DEC, and the redshift, z. 

However, another source for possible inaccuracies in the results of our anal- 
ysis is sample incompleteness. To deal with this issue, and select the most 
possible complete sample among the previously selected data, we proceed as 
follows. 

First, we map the histograms of these three variables. Figure [1] shows, as an 



example, the histogram of RA. 

After examining the three histograms, we choose: 
115 <RA< 250 
25 < DEC < 60 
0.04 < z < 0.16 
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so that the remaining sample looks the most complete in this three-dimensional 
space. 

5 Analysis of the Galaxy Sample 

Here we assume Hq = 70 km s~^ Mpc~^. Note that the exact value retained 
for Hq is not too important since another value would only have as an effect to 
shift the transition scale the ratio of both values. 

We have already noticed that the fact that the minimum of z is sufficiently 
large makes the problem of redshift distortion for close galaxies irrelevant. Since 
Celerier and Thieberger (2001) demonstrated that for a distance, rmax, to 
at least over 600 Mpc, relativistic corrections to the EucHdean distance ap- 
proximation are negligible for the calculations performed here, we might also 
expect that at large z, which are also in our case quite limited, redshift distor- 
tion should not be too significant. However, we want to check this point and, 
for this purpose, we perform our calculations for three different cosmological 
models. We then compare the results to see if we find a significant difference 
between the three results, i.e., much larger than our statistical errors. 

The three models we put to the test are: the Euclidean, Einstein-de Sitter 
and ACDM models. Their luminosity distances as a function of redshift are: 

The Euclidean (or redshift) distance 

De = §- (3) 

The Einstein-de Sitter luminosity distance (Celerier and Thieberger, 
2001) 

DEdS = ^{l^z-VTT^) (4) 
The ACDM luminosity distance (Celerier, 2000) 

Dacdm = -^z + 7^(2 - + 2nA)z^ + ^Fsi^M, ^a) 

Uq 4ilo olio 

Fs{X, Y) = {-2X -AY- 4XY + + 4Y'^)z^ + C>(z^) (5) 
with Qm = 0.25 and Oa = 0.75, it reads 

Dacdm = -^z{l + 0.8125^ - 0.2422^^) (6) 

To obtain the absolute magnitude from the measured apparent magnitude 
displayed in the data, we use 

M = m - 5 logio Dj-K- 25 (7) 

Here Dj is in Mpc and j denotes each of our three models: j = EdS, ACDM. 
According to Surendran (2004), a reasonable approximation for the average 
K-correction applicable to the Sloan catalog is 

K=_2.5(l + ce)logio(l + ^), (8) 

with a = — 0.5. Therefore, we limit our sample to those galaxies that 
correspond to: 

Miim = 17.77 - 25 + 1.251ogio(l + Zmax) -^log^oi^max)- (9) 

Then, we examine the distribution function of the different input parameters. 
This fixes the range of the domain in which we perform our calculations. This 
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Figure 2: The histogram of X,Y in the Euchdean case. In blue, the initial 
domain. In red, the final sample. 



limiting range, dmaxi is the distance from the inner region to the outer region. 
We also determine, for each model, the maximum distance, Vmaxi from our 
galaxy to the most distant in our sample. 

After having turned the RA, DEC, z coordinates into Cartesian coordinates 
for each object and for each model, we make histograms of these coordinates, 
X, y, Z, to obtain the final limits on our samples. These histograms yield ap- 
proximate ranges. Combining this information with the number of inner and 
outer galaxies in a number of ranges, extracted from the domain representing 
our initial sample, and choosing the one with the highest number of galaxies, 
we obtain the best choice of the final sample to which we apply our analysis. 

We start with the Euclidean case. 

As an example we give in Fig. [2j the X, Y domain, for this model. 
The histograms yield the following ranges (in Mpc): 

-450 <X < -300, < r < 80, -320 <Z < -120, 

and rmax=685.6 Mpc comes from Eq. Q where the redshift z is set to its 
maximum value, 0.16. 

As a result our sample is reduced to 21,488 galaxies. We fix also, as explained 
in Sec. O an inner zone of 742 galaxies. 

Then we proceed as follows. We divide the range dmax into 256 segments. 
We apply the Grassberger-Procaccia method to each segment and obtain the 
dimension D2 for each of them. We take three consecutive segment ends and 
perform on them a least square calculation of 1^2 • Then we take the last of these 
three and add two more points for the next least square calculation. Therefore 
we end up with 128 different dimensions. The final correlation dimensions are 
calculated by taking a consistent average D2 on each range where this is feasible, 
which allows us to obtain constant dimensions on rather large scale ranges. By 
completing such an averaging procedure we are also able to calculate the root 
mean square deviation, and so to obtain a reasonable error measure. We find 
impossible to derive a robust constant dimension below 52 Mpc, but in the range 
52 to 124 Mpc we obtain: D2 = 2.98 ± 0.02. Below 52 Mpc, D2 increases from 
about 1 to 3. Above 124 Mpc, the infiuence of the edge effects starts to bring 
into play, so we have to wait for larger catalogs to go beyond this scale. The 
results for this Euclidean case are plotted in Fig. [3l 
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Figure 3: The Euclidean case: D2 as a function of the scale r, obtained by least 
square fits for three points at a time. 

Table 1: D2 values obtained for different numbers of domain divisions, Nbin- 



Nbin 


64 


256 


1024 


D2 


2.95 ±0.02 


2.98 ±0.02 


2.9 ±0.2 



They are in a quite good agreement with that obtained by Hogg et al. (2005) 
for the SDSS luminous red galaxies. 

Some authors (e. g., Tikhonov, 2006) used the F, spherical layers method, 
which was first introduced by Pietronero (1987). A discussion concerning this 
approach can be found in (Thieberger, Spiegel and Smith, 1990). We found 
no advantage to use this method instead of the correlation integral. Still, we 
thought interesting to check with it our results for various domain division 
choices. In Table 1 we show the D2 values obtained for coarser and finer divi- 
sions. We see that going to a finer division does not improve the results. We 
conclude that this might be an additional confirmation that the integral method 
is good enough. 

We now turn to the Einstein-de Sitter model. 

Here, the high density region remains roughly the same as in the Euclidean 
case. Therefore, for comparison sake, we choose the range: 

-450 <X < -300, < r < 80, -340 <Z < -140, 

and obtain a sample reduced to 19,027 galaxies, for which the inner region 
exhibits 629 galaxies. We obtain r^aa;=711.03 Mpc from Eq. (|4]) where z is set 
to its maximum value, 0.16. The average value of the correlation dimension 
in the range 52 to 124 Mpc (just as in the Euclidean model) comes out to be 
D2 = 2.95 ±0.02. 

We notice that the results are rather similar than for the previous cosmolog- 
ical model. They are given in FigHl It is worth stressing that the transition to 
homogeneity seems less complete in this case than in the Euclidean model (the 
fractal value remains somewhat under 3), but the difference is quite minor. 

Now we consider the last case, the ACDM model. 

The preliminary results look less good. The edge effects start earlier and so 
at dmax = 160 Mpc the error on D2 becomes very large. Therefore, we limit 
ourselves to a smaller region, with dmax=^'^^ Mpc. In this case the range taken 
into consideration is: 



9 



20 40 60 so 100 120 140 160 

r(in MFC) 



Figure 4: The Einstein-de Sitter model: D2 as a function of the scale r, obtained 
by least square fits for three points at a time. 




r (in Mpc) 

Figure 5: The ACDM model: D2 as a function of the scale r, obtained by least 
square fits for three points at a time. 



-400 <X < -300, < r < 80, -280 <Z < -130, 

Under these conditions we get again very similar results, but the errors 
outside the chosen range are much larger. The sample is here reduced to 11,698 
galaxies with 390 galaxies in the inner region. From Eq. ([5|), the value for Vmax 
reduces to 676.3 Mpc and between 45 Mpc and 104 Mpc we obtain: D2 = 2.97 ± 
0.02. The analysis results are shown in Fig|5l We notice that the fluctuations 
are larger than in the previous cases, especially at small scales. 

However, we can infer from the similarity of the results obtained with these 
three very different cosmological models that our method is rather model- 
independent. This strengthens the accuracy of our choice to neglect redshift- 
space distortions to perform this analysis. 
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Figure 6: The small random map: D2 as a function of the scale r, obtained by 
least square fits for three points at a time. 

6 Check of the Method: Apphcation to a Ran- 
dom Map Catalog 

To check the robustness of our method we apply it now to a random map. 

We prepare a mock catalog where the "galaxies" are randomly distributed 
within the limits set by the real catalog. Since we have seen that our method 
is almost model-independent, we choose the EdS model as the prototype for 
our random map. We calculate the absolute magnitude, M, of each object from 
Eq. ([7j) where the expression for the luminosity distance,, Dj^ is given by Eq. (|4|). 
This enables us to build a histogram of the M values. We then fit the histogram 
to some function, which we use to obtain random M values distributed just as 
the M's pertaining to the EdS model. The redshift and angles determining the 
sample are chosen randomly: 

RA rdmi {RAmax - RAmin) + RAmin (10) 



DEC = rdm2{DECmax - DECmin) + DECmin (11) 



z = Cy rdm\ + rdm\ + rdm\ (12) 

Here, rdrrii denotes random numbers distributed between and 1, and C is 
chosen so as to correspond to the distribution of the observed redshifts. We use 
a random number generator so that we obtain the same number of "galaxies" 
as in the volume limited catalog. We consider the same dmax and use the same 
calculation method we apply to this mock catalog. 

The plot of the correlation dimension D2 as a function of scales for this 
random map is displayed in FiglHl 

We see that in the small r region this dimension is increasing faster than 
for the observed catalog analyzed in the framework of a similar EdS model. 
Actually, 1^2 = 3 is reached around 35 Mpc. For the range 35 to 100 Mpc, we 
obtain D2 = 2.96 ±0.01. 

Since the correlation dimension we should obtain at all scales for a randomly 
distributed point set in a 3-dimensional volume is 1^2 = 3, we suspect this small 
scale anomaly might be due to the sparse population of these scales. Thus, we 
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Figure 7: The large random map: D2 as a function of the scale r, obtained by 
least square fits for three points at a time. 

repeat the calculation for a point number which is four times as large. The 
results are given in Fig|7l 

Because we have more points, the dimension 3 is reached at lower scale. At 
the large scale limit, the influence of edge effects implies there is no significant 
change. Here we obtain, for the range 20Mpc to lOOMpc, D2 = 2.97 ± 0.01. 

We therefore conclude that we must be very cautious about what we can 
infer from such an analysis at scales below 20-30 Mpc. 

7 Discussion and Conclusion 

We have used the publicly available data from the DR5 SDSS catalog to com- 
plete an analysis of the dimension D2 of the galaxy distribution with the cor- 
relation integral method. Actually, there exists another method well designed 
to deal with the edge effect and described in (Coleman and Pietronero,1992). 
But although our method results in having a smaller value, d^ax^ for the larger 
probed scales, it seems safer as less approximations are involved (Provenzale et 
al., 1997). 

To check the sensitivity of this method as regards redshift-space distortion 
or light cone effects, we performed our analysis in the framework of three dif- 
ferent cosmological models. It came out that the dimension D2 as a function 
of scales obtained in these different cases exhibits almost the same behavior, 
strengthening therefore our feeling that these effects could be safely ignored. 

We performed another check of the robustness of the method by comparing 
the analysis of the observational catalog with that of random maps. We could 
thus see that at small scales, the random catalogs exhibit a fast increase of D2 = 
toward the value 3 pertaining to a homogeneous distribution in a 3-dimensional 
volume, much steeper than the increase obtained with the real catalog. However, 
since 1^2 = 3 is only reached around some 20-30 Mpc with the mock catalogs, 
we stress we must be cautious about what we can infer from such an analysis 
at scales below these values. Moreover, we can conclude from such results that, 
to obtain a more reliable description of the cosmological structures at various 
scales, it will be mandatory to use the larger samples which will be available in 
the future. 

Meanwhile, the application of our method to the galaxy sets we extracted 
from the SDSS DR5 main sample yielded a correlation dimension D2 increasing 
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from very small values up to D2 = 3 around 70-80 Mpc. This is compatible 
with the results obtained by Hogg et al. (2005) for luminous red galaxies and 
by Tikhonov (2006) for the DR4 SDSS Main Galaxy Sample. 

In the smaller scale range where our results can be considered as rather 
robust, i. e., for 20 Mpc < r < 70 Mpc, we noticed a slow increase from D2 = 2 
to D2 = 3. This could be the signature of the galaxy distribution not being 
monofractal (or self similar) in this range, i. e., the correlation dimension is not 
a constant but varies with scale. A discussion of this issue, applied to the galaxy 
distribution, can be found in McCauley (2002). Such a property was expected 
from previous results obtained by, e. g., Murante et al. (1998). 

It might also explain the discrepancies between the results of previous anal- 
yses performed with different samples probing different or too large scale ranges 
and for which self similarity was posed as an a priori assumption (Davis and 
Peebles, 1983; Coleman, Pietronero and Sanders, 1988; Sylos Labini, Montuori 
and Pietronero, 1998; Wu, Lahav and Rees, 1999) . Actually, if the correlation 
dimension varies with scales, the monofractal hypothesis implies that the results 
obtained by these various works must be different. 

Now, one must be aware that what we have examined here is the distribution 
of bright matter only. In standard cosmology, one usually admits that most of 
the baryonic matter is present under the form of "dark matter" , which, since it 
emits no light, cannot be seen in surveys such as the SDSS. We therefore claim 
that our results apply to bright matter only and that their extension to the 
whole baryonic matter content of the Universe would be unwise without further 
proof that "light traces mass" . 
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